AITopics | performance evaluation

fc20ea8d104cab737a5561096f9bde9b-Supplemental-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsMay-1-2026, 05:15:41 GMT

AQuAcurrently includes a variety of datasets for different classification problems, varying in the number of classes, sources of annotations, and data modalities. All datasets except those marked with are multi-class.

artificial intelligence, machine learning, natural language, (15 more...)

Neural Information Processing Systems

Genre: Research Report (0.46)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.67)
Health & Medicine > Therapeutic Area (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

A Appendix

Neural Information Processing SystemsFeb-18-2026, 03:00:34 GMT

The classes are airplanes, cars, birds, cats, deer, dogs, frogs, horses, ships, and trucks, and they are all mutually exclusive.

artificial intelligence, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts (0.04)
North America > United States > Florida > Broward County (0.04)
Asia > Middle East > Israel (0.04)

Genre: Research Report (0.46)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.67)
Health & Medicine > Therapeutic Area (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(3 more...)

Add feedback

a1c716638d9b618a1a40a96f473c8250-Paper-Conference.pdf

Neural Information Processing SystemsFeb-16-2026, 06:03:11 GMT

adversarial attack, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Country: Asia > China > Guangdong Province > Guangzhou (0.04)

Industry: Information Technology > Security & Privacy (0.73)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.95)
(2 more...)

Add feedback

MathNAS: If Blocks Have a Role in Mathematical Architecture Design

Neural Information Processing SystemsFeb-15-2026, 23:22:06 GMT

However, designing large models by NAS is challenging due to the dramatical increase of search space and the associated huge performance evaluation cost.

artificial intelligence, machine learning, natural language, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.96)

Add feedback

63b2b056f48653b7cff0d8d233c96a4d-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-9-2026, 10:35:48 GMT

pearl, performance evaluation, trajectory pair, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency

Park, Sihyeong, Jeon, Sungryeol, Lee, Chaelyn, Jeon, Seokhun, Kim, Byung-Soo, Lee, Jemin

arXiv.org Artificial IntelligenceNov-27-2025

Large language models (LLMs) are widely applied in chatbots, code generators, and search engines. Workload such as chain-of-throught, complex reasoning, agent services significantly increase the inference cost by invoke the model repeatedly. Optimization methods such as parallelism, compression, and caching have been adopted to reduce costs, but the diverse service requirements make it hard to select the right method. Recently, specialized LLM inference engines have emerged as a key component for integrating the optimization methods into service-oriented infrastructures. However, a systematic study on inference engines is still lacking.This paper provides a comprehensive evaluation of 25 open-source and commercial inference engines. We examine each inference engine in terms of ease-of-use, ease-of-deployment, general-purpose support, scalability, and suitability for throughput- and latency-aware computation. Furthermore, we explore the design goals of each inference engine by investigating the optimization techniques it supports. In addition, we assess the ecosystem maturity of open source inference engines and handle the performance and cost policy of commercial solutions.We outline future research directions that include support for complex LLM-based services, support of various hardware, and enhanced security, offering practical guidance to researchers and developers in selecting and designing optimized LLM inference engines. We also provide a public repository to continually track developments in this fast-evolving field: \href{https://github.com/sihyeong/Awesome-LLM-Inference-Engine}{https://github.com/sihyeong/Awesome-LLM-Inference-Engine}.

artificial intelligence, large language model, natural language, (20 more...)

arXiv.org Artificial Intelligence

2505.01658

Country: Asia > South Korea (0.28)

Genre:

Research Report > New Finding (1.00)
Overview (1.00)
Workflow (0.92)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine (0.92)
Information Technology > Services (0.67)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Performance Evaluation of Bitstring Representations in a Linear Genetic Programming Framework

Meli, Clyde, Nezval, Vitezslav, Oplatkova, Zuzana Kominkova, Buttigieg, Victor, Staines, Anthony Spiteri

arXiv.org Artificial IntelligenceNov-6-2025

Different bitstring representations can yield varying computational performance. This work compares three bitstring implementations in C++: std::bitset, boost::dynamic_bitset, and a custom direct implementation. Their performance is benchmarked in the context of concatenation within a Linear Genetic Programming system. Benchmarks were conducted on three platforms (macOS, Linux, and Windows MSYS2) to assess platform specific performance variations. The results show that the custom direct implementation delivers the fastest performance on Linux and Windows, while std::bitset performs best on macOS. Although consistently slower, boost::dynamic_bitset remains a viable and flexible option. These findings highlight the influence of compiler optimisations and system architecture on performance, providing practical guidance for selecting the optimal method based on platform and application requirements.

evolutionary algorithm, machine learning, programming language, (17 more...)

arXiv.org Artificial Intelligence

2511.02897

Country:

Europe (0.16)
North America > United States (0.14)

Genre: Research Report (0.84)

Technology:

Information Technology > Software > Programming Languages (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (0.87)

Add feedback

Performance Evaluation of Ising and QUBO Variable Encodings in Boltzmann Machine Learning

Hasegawa, Yasushi, Ohzeki, Masayuki

arXiv.org Artificial IntelligenceOct-16-2025

We compare Ising ({-1,+1}) and QUBO ({0,1}) encodings for Boltzmann machine learning under a controlled protocol that fixes the model, sampler, and step size. Exploiting the identity that the Fisher information matrix (FIM) equals the covariance of sufficient statistics, we visualize empirical moments from model samples and reveal systematic, representation-dependent differences. QUBO induces larger cross terms between first- and second-order statistics, creating more small-eigenvalue directions in the FIM and lowering spectral entropy. This ill-conditioning explains slower convergence under stochastic gradient descent (SGD). In contrast, natural gradient descent (NGD)-which rescales updates by the FIM metric-achieves similar convergence across encodings due to reparameterization invariance. Practically, for SGD-based training, the Ising encoding provides more isotropic curvature and faster convergence; for QUBO, centering/scaling or NGD-style preconditioning mitigates curvature pathologies. These results clarify how representation shapes information geometry and finite-time learning dynamics in Boltzmann machines and yield actionable guidelines for variable encoding and preprocessing.

artificial intelligence, machine learning, qubo, (15 more...)

arXiv.org Artificial Intelligence

2510.1321

Country: Asia > Japan > Honshū > Tōhoku (0.14)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.85)

Add feedback

Vulnerabilities in Video Quality Assessment Models: The Challenge of Adversarial Attacks Ao-Xiang Zhang Y u Ran Weixuan T ang Y uan-Gen Wang

Neural Information Processing SystemsOct-9-2025, 03:14:36 GMT

No-Reference Video Quality Assessment (NR-VQA) plays an essential role in improving the viewing experience of end-users. Driven by deep learning, recent NR-VQA models based on Convolutional Neural Networks (CNNs) and Transformers have achieved outstanding performance.

adversarial attack, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Country: Asia > China > Guangdong Province > Guangzhou (0.04)

Industry: